Océ at TREC 2003
نویسندگان
چکیده
This report describes the work done at Océ Research for the TREC 2003. This first participation consists of ad hoc experiments for the Robust track. We used the BM25 model and our new probabilistic model to rank documents. Knowledge Concepts’ Content Enabler semantic network was used for stemming and query expansion. Our main goal was to compare the BM25 model and the probabilistic model implemented with and/or without query expansion. The developed generic probabilistic model does not use global statistics of a document collection to rank documents. The relevance of the document to a given query is calculated using term frequencies of the query terms in the document and the length of the document. Furthermore, some theoretical research has been done. We have constructed a model that uses relevance judgements of previous years. However, we did not implement it due to the time constraints.
منابع مشابه
Océ at CLEF 2003
This report describes the work done at Océ Research for the Cross-Language Evaluation Forum (CLEF) 2003. This year we participated in seven mono-lingual tasks (all languages except Russian). We used the BM25 model, a probabilistic and (for Dutch only) a statistical model to rank documents. Knowledge Concepts’ Content Enabler semantic network was used (for statistical model only) for stemming, a...
متن کاملUsing Mt in a Corporate Setting
Company introduction Océ Technologies is a manufacturer of plotters, printers, design & engineering equipment and supplies. Océ has operating companies in 30 countries and is active in 80 countries. About 17,000 people are employed worldwide; 3000 are based at the head office in Venlo, The Netherlands. Implementation of MT After prototyping an MT system at Océ R&D to demonstrate its feasibility...
متن کاملOverview of TREC 2003
The twelfth Text REtrieval Conference, TREC 2003, was held at the National Institute of Standards and Technology (NIST) November 18–21, 2003. The conference was co-sponsored by NIST, the US Department of Defense Advanced Research and Development Activity (ARDA), and the Defense Advanced Research Projects Agency (DARPA). TREC 2003 is the latest in a series of workshops designed to foster researc...
متن کاملCombining First and Second Order Features in the TREC 2003 Robust Track
This year at TREC 2003 we participated in the robust track and investigated the use of very simple retrieval rules based on convex combinations of similarity measures based on first and second order features.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003